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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANTS i Seed, Brian et al. 

(ii) TITLE OF INVENTION: Targeted Cytolysis of HIV-infected Calls by 

Chimeric CD4 Receptor-Bearing Cells 

(iii) NUMBER OF SEQUENCES t 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 
<F> ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3\5" Diskette, 1.44 Mb 

(B) COMPUTER: IBM PS/2 Model 50Z or 55SZ 

(C) OPERATING SYSTEM: IBM P.C. DOS (Version 3.30) 

(D) SOFTWARE: Wordperfect (Version 5.0) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/847,566 

(B) FILING DATE: March 6, 1992 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/665,961 

(B) FILING DATE: March 7 f 1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE/ DOCKET NUMBER: 00786/212001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 
(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOtl: 

ATOAACCGGGT GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC 50 

GCTCCTCCCA GCAGCCACTC AGGGAAACAA AGTOOTOCTO GGCAAAAAAG 100 

GGGATACAGT GGAACTGACC TGTACAGCTT CCCAGAAGAA GAGCATACAA 150 

TTCCACTGGA AAAACTCCAA CCAGAIAAAG ATTCTGGGAA ATCAGOGCTC 200 

CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT GACTCAAGAA 250 

GAAGCCTTTO GGACCAAGGA AACTTCCCCC TGATCATCAA. GAATCTTAAG 300 

ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA 350 

GGTGCAATTG CTAGTGTTCG GAOTGACTGC CAACTCTGAC ACCCACCTGC 400 

TTCAGGGGCA GAGCCTGACC CTGACCTTGG AGAGCCCCCC TGGTAGTAGC 450 

CCCTCAGTGC AATGTAGGAG TCCAAGGGGT AAAAACATAC AGGGGGGGAA 500 

GACCCTCTCC GTGTCTCAGC TGGAGCTCCA GGATAGTGGC ACCTGGACAT 550 

GCACTGTCTS GCAGAACCAG AAGAAGGTGG AGTTCAAAAT AGACATCGTG 600 

GTGCTAGCTT TCCAGAAGCC CXCCAGCATA GTCTATAAGA AAGAGGGGGA 650 

ACAGGTGGAG TTCTCCTTCC CACTCGCCTX TACAGTTGAA AAGCTGACGG 700 

GCAGTGGCGA GCTGTGGTGG CAGGCGGAGA GGGCTTCCTC CTCCAAGTCT 750 

TGGATCACCT TTGACCTGAA GAACAAGGAA GTGTCTGTAA AACGGGTTAC 800 

CCAGGACCCr AAGCTCCAGA TGGGCAAGAA GCTCCCGCTC CACCTCACCC 850 

TGCCCCAGGC CTTGCCTCAG TATOCTGOCT CTGGAAACCT CACCCTGGCC 900 

CTTGAAGCGA AAACAGGAAA GTTGCATCAG GAAGTGAACC TGGTGGTGAT 950 

GAGAGCCACT CAGCTCCAGA AAAATTTGAC CTGTGAGGTO TGGGGACCCA 1000 

CCTCCCCTAA GCTGATGCTG AGCTTGAAAC TGGAGAACAA GGAGOCAAAG 1050 

GTCTCGAAGC GGGAGAAGCC GGTGTGGGTO CTGAACCCTG AGGCGGGGAT 1100 

GTGGCAGTCT CTGCTGAGTG ACTCGGGACA GGTCCTGCTG GAATCCAACA 1150 

TCAAGGTTCT GCCCACATGG TCCACCCCGG TGCACGCGGA TCCCAAACTC 1200 

TGCTACTTGC TAGATGGAAT CCTCTTCATC TACGGAGTCA TCATCACAGC 1250 

CCTGTACCTG AGAGCAAAAT TCAGCAGGAG TGCAGAGACT GCTGCCAACC 1300 

TGCAGGACCC CAACCAGCTC XACAATGAGC TCAATCTAGG GCGAAGAGAG 1350 

GAATATGACG TCTTGGAGAA GAAGCGGGCT CGGGATCCAG AGATGGGAGG 1*00 

CAAACAGCAG AGGAGGAGGA ACCCCCAGGA AGGCGTATAC AATOCACTGC 1^50 

AGAAAGACAA GATGCCAGAA GCCTACAGTG AGATCGCCAC AAAAGGCGAG 1500 
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AGGCGGAGAG GCAAGGGGCA CGATQGCCTT TACCAGGACA GCCACTTCCA 1550 

AGCAGTGCAG' TTCGGGAACA GAAGAGAGAG AGAAGGTTCA GAACTCACAA 1600 

GGACCCTTGG GTTAAGAGCC CGCCCCAAAG GTGAAAGCAC CCAGCAGAGT 1650 

AGCCAATCCT GTGCCAGCGT CTTCAGCATC CCCACTCTGT GGAGTCCATO 1700 

GCCACCCAGT AGCAGCTCCC AGCTCTAA 1728 



(2) INFORMATION FOR SEQ ID NO: 2 S 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ATGAACCGGG GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC 50 

GCTCCTCCCA GCAGCCACTC AGGGAAACAA AGTGGTGCTG GGCAAAAAAG 100 

GGGATACAGT GGAACTGACC TGTACAGCTT CCCAGAAGAA GAGCATACAA 150 

TTCCACTGGA AAAACTCCAA CCAGAIAAAG ATTCTGGGAA ATCAOGGCTC 200 

CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT GACTCAAGAA 250 

GAAGCCTTTG GGACCAAGGA AACTTCCCCC TGATCATCAA GAATCTTAAG 300 

ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA 350 

GGTGCAATTG CTAGTGTTCG GATTGACTGC CAACTCTGAC ACCCACCTGC 400 

TTCAGGGGCA GAGCCTGACC CTGACCTTGG AGAGCCCCCC TGGTAGTAGC 450 

CCCTCAGTGC AATGTAGGAG TCCAAGGGGT AAAAACATAC AGGGGGGGAA 500 

GACCCTCTCC GTGTCTCAGC TGGAGCTCCA GGATAGTGGC ACCTGGACAT 550 

GCACTGTCTT GCAGAACCAG AAGAAGGTGG AGTTCAAAAT AGACATCGTG 600 
GTGCTAGCTT TCCAGAAGGC CTCCAGCATA GTCTATAAGA AAGAGGGGGA 650 
ACAGGTGGAG TTCTCCTTCC CACTCGCCTT TACAGTTGAA AAGCTGACGG 700 
GCAGTGGCGA GCTGTGGTGG CAGGCGGAGA GGGCTTCCTC CTCCAAGTCT 750 
TGGATCACCT TTGACCTGAA GAACAAGGAA GTGTCTGTAA AACGGGTTAC 800 
CCAGGACCCT AAGCTCCAGA TGGGCAAGAA GCTCCCGCTC CACCTCACCC 850 
TOCCCCAGGC CTTGCCTCAG TATGCTGGCT CTGGAAACCT CACCCTGOCC 900 
CTTGAAGCGA AAACAGGAAA GTTGCATCAG GAAGTGAACC TGGTGGTGAT 950 




GAGAGCCACT CAGCTCCAGA AAAATITGAC CTGTGAGGTG TGGGGACCCA 1000 

CCTCCCCTAA GCTGATGCTG AGCTTGAAAC TGGAGAACAA GGAGGCAAAG 1050 

GTCTCGAAGC GGGAGAAGCC GGTGTGGGTG CTGAACCCTG AGGCGGGGAT 1100 

GTGGCAGTGT CTGCTGAGTG ACTCGGGACA GGTCCTGCTG GAATCCAACA 1150 

TCAAGGTTCT GCCCACATGG TCCACCCCGG TGCACGCGGA TCCGCAGCTC 1200 

TGCTATATCC TGGATGCCAT CCTGTTTTTG TATGGTATTG TCCTTACCCT 1250 

GCTCTACTGT CGACTCAAGA TCCAGGTCCG AAAGGCAGAC ATAGCCAGCC 1300 

GTGAGAAATC AGATGCTGTC TACACGGGQC TGAACACCCO GAACCAGGAG 1350 

ACAXATGAGA CTCTGAAACA TGAGAAACCA CCCCAATAG 1389 



(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGAACCGGG GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC 50 

GCTCCTCCCA GCAGCCACTC AGGGAAACAA AGTGGTGCTG GGCAAAAAAG 100 

GGGATACAGT GGAACTGACC TGTACAGCTT CCCAGAAGAA GAGCATACAA 150 

TTCCACTGGA AAAACTCCAA CCAGATAAAG ATTCTGGGAA ATCAGGGCTC 200 

CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT GACTCAAGAA 250 

GAAGCCTTTG GGACCAAGCA AACTTCCCCC TGATCATCAA GAATCTTAAG 300 

ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA 350 

GGTGCAATTO CTAGTGTTCG GATTGACTGC CAACTCTGAC ACCCACCTGC 400 

TTCAGGGGCA GAGCCTGACC CTGACCTTGG AGAGCCCCCC TGGTAGTAGC 450 

CCCTCAGTGC AATGTAGGAG TCCAAGGGGT AAAAACATAC AGOGGGGGAA 500 
GACCCTCTCC GTGTCTCAGC TGGAGCTCCA GGATAGTGGC ACCTGGACAT 550 
GCACTGTCTT GCAGAACCAG AAGAAGGTGG AGTTCAAAAT AGACATCGTG 600 
GTGCTAGCTT TCCAGAAGGC CTCCAGCATA GTCTATAAGA AAGAGGGGGA $50 
ACAGGTGGAG TTCTCCTTCC CACTCGCCTT TACAGTTGAA AAGCTGACGG 700 
GCAGTGGCGA GCTGTGGTGG CAGGCGGAGA GGGCTTCCTC CTCCAAGTCT 750 
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TOGATCACCT TTGACCTGAA GAACAAGGAA GTGTCTGTAA AACGGGTTAC 800 

CCAGGACCCT AAGCTCCAGA TGGGCAAGAA GCTCCCGCTC CACCTCACCC 850 

TGCCCCAGGC CTTGCCTCAG TATGCTGGCT CTGOAAACCT CACCCTGGCC 900 

CTTGAAGCGA AAACAGGAAA GTTGCATCAG GAAGTGAACC TGGTGGTGAT 950 

GAGAGCCACT CAGCTCCAGA AAAATTTGAC CTGTGAGGTG TGGGGACCCA 1000 

CCTCCCCTAA GCTGATGCTG AGCTTGAAAC TGGAGAACAA GGAGGCAAAG 1050 

GTCTCGAAGC GGGAGAAGCC GGTGTGGGTG CTGAACCCTG . AGGCGGGGAT 1100 

GTGGCAGTGT CTGCTGAGTG ACTCGGGACA GGTCCTGCTG GAATCCAACA 1150 

TCAAGGTTCT GCCCACATGG TCCACCCCGG TGCACGCGGA TCCCAAACTC 1200 

TGCTACCTGC TGGATGGAAT CCTCTTCATC TATGGTGTCA TTCTCACTGC 1250 

CTTGTTCCTG AGAGTGAAGT JTCAGCAGGAG CGCAGAOCCC CCCGCGTACC 1300 

AGCAGGGCCA GAACCAGCTC TATAACGAGC TCAATCTAGO ACGAAGAGAG 1350 

GAGTACGATG TTTTGGACAA GAGACGTGGC CGGGACCCTG AGATGGGGGG 1400 

AAAGCCGAGA AGGAAGAACC CTCAGGAAGG CCTGTACAAT GAACTGCAGA 1450 

AAGATAAGAT GGCGGAGGCC TACAGTGAGA TTGGGATGAA AGGCGAGCGC 1500 

CGGAGGGGCA AGGGGCACGA TGGCCTTTAC CAGGGTCTCA GTACAGCCAC 1550 

CAAGGACACC TACGACGCCC TTCACATGCA GGCCCTGCCC CCTCGCTAA 1599 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION I SEQ ID NO: 4: 

Met Asa Arg Gl? Val Pro Phe Arg Hit Leu Leu Leu Val Leu Gin Leu 

15 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Asn Lys Val Val Leu Gly Lys 

20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

35 40 45 

lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys He Leu Gly Asn 

50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu lie He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Cla 

100 105 HO 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 



S.r A.p Thr Hi, Leu Leu «g oly Gl» S.r L.u Thr Leu Thr Leu Gl« 
Ser Pro Pro oly s.r Ser Pro Ser Val Oln Cy. ^ 8er Pro ^ oly 
I*. A.n II. fll„ oly Oly Ly. Thr Leu Ser v" Ser Oln Leu Olu Leu 
Oln A.p ser Oly thr Trp Thr Cy. Thr vl? Leu oln A , n Qla ™ Ly , 
Val Olu Ph. Ly. U . A , p xl . val Val Leu Al. Phe oln Ly. Al. Ser 
Ser lie v.l Tyr Ly. Ly. Glu Oly Olu oln Val olu III Ser Ph. Pro 



Leu Ala Ph. Thr Val olu Ly^ Leu Thr Oly Ser £? olu Leu Trp Trp 
Gin Ala Olu Arg Ala Ser Ser Ser Ly. Ser 4 II. Thr Ph. A.p Leu 
lOT- Asn Ly. Olu Val Ser Val Ly. Arg VaJ Thr Oln A.p Pro Ly. Leu 



Gin Met Oly Ly. Ly. L.u Pro Leu Hi. Leu Thr Leu Pro III Ala Leu 



Pro Oln Tyr Ala Oly S . r oiy,A.n Leu Thr L.u Al. "u Olu Al. Ly 



295 



Thr Oly Ly. Leu Hi. Oln olu Val Asn Leu Val III Met Arg Al. Thr 



310 315 



Oln L.u Oln Ly. A.n Leu Thr Cy. Olu V.l Trp oly Pro Thr S.r Pro 
I*. Leu Met Leu Ser Leu Ly. Leu Olu A.n Ly. Olu Al. Ly. v.l S.r 
Ly. Arg Olu Ly. Pro V.l Trp Val L.u A.n Pro Olu Al. o" M.t Trp 



ill val ^ Pro Thr ss ie; Thr Pro Val Hi * ^* *«P P " **« 



Oln Cy. Leu Leu Ser A.p Ser oIy" Oln Val Leu Leu o" S.r A.n XI. 

. a _ Leu 

Cy. Tyr Leu Leu A,p Oly He Leu Ph. U. £ oly Val xl . xl . 



390 



Ala Leu Tyr L.u Arg Ala Ly. Phe s.r Arg Ser Ala Olu Thr IS Ala 
A.n L.u Oln A.p Pro A.n Oln Leu Tyr A.n Olu Leu A.n Jeu Oly Arg 
Arg Olu Olu Tyr A, P Val Leu Olu Ly. Ly, Arg Al. Arg A.p Pro Olu 
Met Oly Oly Lys Gin Oln Arg Arg Arg Asn Pro Oil oi„ oly Val Tyr 

Asp 

Thr Ly. Oly Olu Arg Arg Arg Oly Ly. «J Hi , A.p Oly Leu £r Oln 



Asn Ala Leu Oln Ly. Asp Ly. Met Pro Olu Jla Tyr S.r Olu II. 2J 



490 



Asp Ser His Phe Gin Ala Val Gin Phe Oly Asn a«* *~ «i° . 

515 52Q 7 A8a Ar 9 Arg Glu Arg Olu 



520 



Gly Ser Olu Leu =hr Arg Thr Leu Oly r« Arg Ala £J prQ Ly , Wy 



535 



Glu Ser Thr Oln Gin Ser Ser Oln Ser Cy, Al. Ill v.l Ph. Ser II. 



550 



Pro Thr Leu Trp Ser Pro Trp Pro Pro Ser III ser Ser Gin Leu 
565 570 575 

(2) INFORMATION TOR SEQ ID NO: 5: 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 



560 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: proteia 

(xi) SEQUENCE DESCRIPTION SEQ ID HO: St 

Met Asa Arg Gly Val Pro Phe Arg Hi* Leu Leu Leu Val Leu Ola Leu 

15 10 15 

Ala Leu Leu Pro Ala Ala Thr Gla Gly Asa Lys Val Val Leu Gly Lys 

20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gla Lya Lys Ser 

35 40 45 

lie Gla Phe His Trp Lys Asa Ser Asa Gin He Lys lie Leu Gly Asa 

50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asa Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gla Gly Asa Phe Pro Leu He He 

85 90 95 

Lys Asa Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 

100 105 HO 

Asp Gla Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asa 

115 120 125 

Ser Asp Thr His Leu Leu Gla Gly Gla Ser Leu Thr Leu Thr Leu Glu 

130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gla Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asa He Gla Gly Gly Lys Thr Leu Ser Val Ser Gla Leu Glu Leu 

165 170 175 

Gla Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gla Asa Gla Lys Lys 

180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gla Lys Ala Ser 

195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gla Val Glu Phe Ser Phe Pro 

210 215 220 

Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gla Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 

245 250 255 

Lys Asa Lys Glu Val Ser Val Lys Arg Val Thr Gla Asp Pro Lys Leu 

260 265 270 

Gla Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gla Ala Leu 

275 280 285 

Pro Gla Tyr Ala Gly Ser Gly Asa Leu Thr Leu Ala Leu Glu Ala Lys 

290 295 300 

Thr Gly Lys Leu Eis Gla Glu Val Asa Leu Val Val Met Arg Ala Thr 
305 310 315 320 

Gla Leu Gla Lys Asa Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 

325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asa Lys Glu Ala Lys Val Ser 

340 345 350 

Lys Arg Glu Lys Pro Val Trp Val Leu Asa Pro Glu Ala Gly Met Trp 

355 360 365 

Gla Cys Leu Leu Ser Asp Ser Gly Gla Val Leu Leu Glu Ser Asa He 

370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val His Ala Asp Pro Gla Leu 
385 390 395 400 

Cys Tyr He Leu Asp Ala He Leu Phe Leu Tyr Gly He Val Leu Thr 

405 410 415 

Leu Leu Tyr Cys Arg Leu Lys He Gla Val Arg Lys Ala Asp He Ala 

420 425 430 

Ser Arg Glu Lys Ser Asp Ala Val Tyr Thr Gly Leu Asa Thr Arg Asa 
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435 440 445 

Ola Olu Thr Tyr Glu Thr Leu Lys His Glu Lys Pro Pro Gla 
450 455 460 462 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

I S 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Asn Lys Val Val Leu Gly Lys 

20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

35 * 40 45 

lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys He Leu Gly Asn 

50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 

85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 

100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 

115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 

130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 

165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 

180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 

195 200 205 

Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 

210 215 220 

Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 

245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 

260 265 270 

Gin Met. Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 

275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 

290 295 300 

Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val Met Arg Ala Thr 
305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 

325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 

340 345 350 

Lys Arg Glu Lys Pro Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 



Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn He 

370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val His Ala Asp Pro Lys Leu 
385 390 395 400 

Cys Tyr Leu Leu Asp Gly He Leu Pbe He Tyr Gly Val He Leu Thr 

405 410 415 

Ala Leu Phe Leu Arg Val Lys Phe Ser Arg Ser Ala Glu Pro Pro Ala 

420 425 430 

Tyr Gin Gin Gly Gin Asn Gin Leu Tyr Asn Glu Leu Asn Leu Gly Arg 

435 440 445 

Arg Glu Glu Tyr Asp Val Leu Asp Lys Arg Arg Gly Arg Asp Pro Glu 

450 455 460 

Met Gly Gly Lys Pro Arg Arg Lys Asn Pro Gin Glu Gly Leu Tyr Asn 
465 470 475 480 

Glu Leu Gin Lys Asp Lys Met Ala Glu Ala Tyr Ser Glu He Gly Met 

485 490 495 

Lys Gly Glu Arg Arg Arg Gly Lys Gly His Asp Gly Leu Tyr Gin Gly 

500 505 510 

Leu Ser Thr Ala Thr Lys Asp Thr Tyr Asp Ala Leu His Met Gin Ala 

515 520 525 

Leu Pro Pro Arg 
530 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGCGGGGTGA CCGTGCCCTC CAGCAGCTTG GGC 

<2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8 J 

CGCGGGGATC CGTCGTCCAG AGCCCGTCCA GCTCCCCGTC CTGGGCCTCA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LESGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CGCGGGCGGC CGCGACGCCG GCCAAGACAG CAC 



(2) INFORMATION POR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CGCGTTGACG AGCAGCCAGT TGGGCAGCAG CAG 



(2) INFORMATION POR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 15 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CGCGGGCGGC CGCZX 



(2) INFORMATION POR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE3G TH: 42 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULS TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CGCGGGCTCG TTATXGAGCT GGTTCTGGCG CTGCTTCTTC 



(2) INFORMATION POR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 48 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CGCGGGGAGC TCOTTATAOA OCTGOTTTGC COCCQAATTC TTATCCCO 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CGCGGGGCGG CCACGCGTCC TCGCCAGCAC ACA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CGCGGGACGC GTTTCAGCCG TCCTCGCCAG CACACA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CGCGGGACGC GTGACCCTGA GATGGGGGGA AAG 



(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



COCGOGACOC GTATTGGGAT GAAAGGCGAG CGC 



(2) INFORMATION TOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CCCGGATCCC AGCATGGGCA GCTCTT 



(2) INFORMATION TOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGCGGGGCGG CCGCTTTAGT TATTACTGTT GACATGGTCG TT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUEN CE CH ARACTERISTICS : 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GCGGGGGGAT CCCACTGTCC AAGCTCCCAG 



33 



26 



42 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GCGGGGGCGG CCGCCTAAAT ACGGTTCTGG TC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
" (A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



TCAGAAAGAG ACAACCTGAA GAAACCAACA A 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 1 
TTOTTOOTTT CTTCAGGTTO TGTCTTTCTO X 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID:NO:24: 

Mot Glu His Ser Thr Phe Leu Ser Gly Leu Val Leu Ala Thr Leu Leu 

5 10 is 

Ser Gin Val Ser Pro Phe Lys lie Pro lie Glu Glu Leu Glu Asp Arg 

20 - 25 30 

Val Phe Val Asn Cys Asn Thr Ser lie Thr Trp Val Glu Gly Thr Val 

35 40 45 

Gly Thr Leu Leu Ser Asp lie Thr Arg Leu Asp Leu Gly Lys Arg lie 

50 55 60 

Leu Asp Pro Arg Gly lie Tyr Arg Cys Asn Gly Thr Asp lie Tyr Lys 
65 70 75 80 

Asp Lys Glu Ser Thr Val Gin Val His Tyr Arg Met Cys Gin Ser Cys 

85 90 95 

Val Glu Leu Asp Pro Ala Thr Val Ala Gly lie lie Val Thr Asp Val 

100 v_ 105 HO 

Ala lie Thr Leu Leu Leu Ala Leu Gly Val Phe Cys Phe Ala Gly His 

115 120 125 

Glu Thr Gly Arg Leu Ser Gly Ala Ala Asp Thr Gin Ala Leu Leu Arg 

130 135 140 

Asn Asp Gin Val Tyr Gin Pro Leu Arg Asp Arg Asp Asp Ala Gin Tyr 
145 150 155 160 

Ser His Leu Gly Gly Asn Trp Ala Arg Asn Lys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPB: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Glu Gin Gly Lys Gly Leu Ala Val Leu lie Leu Ala lie lie Leu 

5 10 is 

Leu Gin Gly Thr Leu Ala Gin Ser lie Lys Gly Asn His Leu Val Lys 

20 25 30 

Val Tyr Asp Tyr Gin Glu Asp Gly Ser Val Leu Leu Thr Cys Asp Ala 

35 40 45 

Glu Ala Lys Asn He Thr Trp Phe Lys Asp Gly Lys Met He Gly Phe 
50 55 60 



% 



% 
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Leu Thr Glu Asp Lys Lys Lys Trp Asn Leu Gly Ser Asa Ala Lys Asp 
65 70 75 80 

Pro Arg Gly Met Tyr Gin Cys Lys Gly Ser Gin Asn Lys Ser Lys Pro 

85 90 95 

Leu Gin Val Tyr Tyr Arg Met Cys Gin Asn Cys He Glu Leu Asn Ala 

100 105 HO 

Ala Thr lie Ser Gly Phe Leu Phe Ala Glu He Val Ser He Pbe Val 

115 120 125 

Leu Ala Val Gly Val Tyr Phe He Ala Gly Gin Asp Gly Val Arg Gin 

130 135 140 

Ser Arg Ala Ser Asp Lys Gin Thr Leu Leu Pro Asn Asp Gin Leu Tyr 
145 150 155 160 

Gin Pro Leu Lys Asp Arg Glu Asp Asp Gin Tyr Ser His Leu Gin Gly 

165 170 175 

Asn Gin Leu Arg Arg Asn 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acids 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met Pro Gly Gly Leu Glu Ala Leu Arg Ala Leu Pro Leu Leu Leu Phe 

5 *- 10 15 

Leu Ser Tyr Ala Cys Leu Gly Pro Gly Cys Gin Ala Leu Arg Val Glu 

20 25 30 

Gly Gly Pro Pro Ser Leu Thr Val Asn Leu Gly Glu Glu Ala Arg Leu 

35 40 45 

Thr Cys Glu Asn Asn Gly Arg Asn Pro Asn He Thr Trp Trp Phe Ser 

50 55 60 

Leu Gin Ser Asn He Thr Trp Pro Pro Val Pro Leu Gly Pro Gly Gin 
65 70 75 80 

Gly Thr Thr Gly Gin Leu Phe Phe Pro Glu Val Asn Lys Asn Thr Gly 

85 90 95 

Ala Cys Thr Gly Cys Gin Val He Glu Asn Asn He Leu Lys Arg Ser 

100 105 HO 

Cys Gly Thr Tyr Leu Arg Val Arg Asn Pro Val Pro Arg Pro Phe Leu 

115 120 125 

Asp Met Gly Glu Gly Thr Lys Asn Arg He He Thr Ala Glu Gly He 

130 135 140 

He Leu Leu Phe Cys Ala Val Val Pro Gly Thr Leu Leu Leu Phe Arg 
145 150 155 160 

Lys Arg Trp Gin Asn Glu Lys Phe Gly Val Asp Met Pro Asp Asp Tyr 



Glu Asp Glu Asn Leu Tyr Glu Gly Leu Asn Leu Asp Asp Cys Ser Met 

180 185 190 

Tyr Glu Asp He Ser Arg Gly Leu Gin Gly Thr Tyr Gin Asp Val Gly 

195 200 205 

Asn Leu His He Gly Asp Ala Gin Leu Glu Lys Pro 



180 



165 



170 



175 



210 



215 



220 



(2) INFORMATION FOR SEQ ID NO: 27: 



% 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 27 1 




Met Ala Thr Leu Val Leu Ser Ser Met Pro Cys His Trp Leu Leu Phe 

5 10 15 

Leu Leu Leu Leu Phe Ser Gly Glu Pro Val Pro Ala Met Thr Ser Ser 

20 25 30 

Asp Leu Pro Leu Asn Phe Gin Gly Ser Pro Cys Ser Gin lie Trp Gin 

35 40 45 

His Pro Arg Phe Ala Ala Lys Lys Arg Ser Ser Met Val Lys Phe His 

50 55 60 

Cys Tyr Thr Asn His Ser Gly Ala Leu Thr Trp Phe Arg Lys Arg Gly 
65 70 75 80 

Ser Gin Gin Pro Gin Glu Leu Val Ser Glu Glu Gly Arg lie Val Gin 

85 * 90 95 

Thr Gin Asn Gly Ser Val Tyr Thr Leu Thr lie Gin Asn lie Gin Tyr 

100 105 110 

Glu Asp Asn Gly lie Tyr Phe Cys Lys Gin Lys Cys Asp Ser Ala Asn 

115 120 125 

His Asn Val Thr Asp Ser Cys Gly Thr Glu Leu Leu Val Leu Gly Phe 

130 135 140 

Ser Thr Leu Asp Gin Leu Lys Arg Arg Asn Thr Leu Lys Asp Gly lis 
145 150 155 160 

He Leu He Gin Thr Leu Leu He He Leu Phe He He Val Pro Ha 

165 ^ 170 175 

Phe Leu Leu Leu Asp Lys Asp Asp Gly Lys Ala Gly Met Glu Glu Asp 

180 185 190 

His Thr Tyr Glu Gly Leu Asn He Asp Gin Thr Ala Thr Tyr Glu Asp 

195 200 205 

He Val Thr Leu Arg Thr Gly Glu Val Lys Trp Ser Val Gly Glu His 

210 215 220 

Pro Gly Gin Glu 
225 



